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Abstract 



We propose a proof of convergence of an adaptive method used in molecular dy- 
namics to compute free energy profiles (see [IllSlIlS]). Mathematically, it amounts 

■ to studying the long-time behavior of a stochastic process which satisfies a non-linear 
I stochastic differential equation, where the drift depends on conditional expectations 

of some functional of the process. We use entropy techniques to prove exponential 
convergence to the stationary state. 

■ 1 Introduction 

. In Section II. H we introduce the physical context of this work, namely molecular 

dynamics and the computation of free energy differences in the canonical statistical 
. ensemble. In Section [1^2} we introduce the adaptive dynamics we study and the main 

' results we prove are presented in Section [131 

o 



1.1 Computations of free energy differences and metastability 

Let us consider the Gibbs-Boltzmann measure 

d^l{q) = Z-'eM-PV{q))dq, (1) 

where q e V, V : V R, Z ^ exp{~ (]¥ (q)) dq and V = {q, V{q) < 00} is the 
configuration space. In the appHcations we consider, q represents the position of N 
particles so that, in the following, V is an open subset (possibly the whole) of M", 
with n = 3N. All the results we prove are also satisfied if V is an open subset of T" 
(where T ~ M./Z denotes the one-dimensional torus). The function V is the energy 
associated with the positions of the particles and (3 is proportional to the inverse of the 
temperature. The probability measure represents the equilibrium measure sampled 
by the particles in the canonical statistical ensemble. A typical dynamics that can be 
used to sample this measure is 

dQt = -Vy(Qt) dt + ^/2p^dBt, (2) 

where Bt is a n-dimensional standard Brownian motion. More generally, for any 
smooth positive function 7 : I? ^ M^, the stochastic process Qt which satisfies 



dQt - -V(y - ln7)(Qt)7(0t) dt + y^W^MQ^dBt (3) 
samples the measure /J,. 
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Let us introduce a so-called reaction coordinate £, -.V^M, with M = R or M = 
T. For a given configuration q, £,{q) represents a coarse-grained information, which is 
valuable from a physical point of view. For instance, £,{q) may be a dihedral angle, for 
example to characterize the conformation of a molecule, in which case Al = T, or the 
signed distance to an hypersurface of V (characterizing a transition state) , for example 
to measure the evolution of a chemical reaction, in which case = M. The function 
£, is therefore related to some macroscopic information of the system. Usually, in 
the time-scale for the dynamics on £,{Qt) is larger than the time-scale for the dynamics 
on Qt (due to metastable states), so that ^ can also be understood as a function such 
that ^{Qt) is a slow variable compared to Qt. 

In the following, we suppose that 

[HI] ^ is a smooth function such that |V^| > on I?. 

Thus, the subsets = {a; e V, £^{x) — z} oi V are smooth submanifolds of co- 
dimension one which define a partition of T>: 



V= [j Y.^ and n E^- = for z ^ z' . 



We denote by (Jy,^ the surface measure on S^, i.e. the Lebesgue measure on Ez induced 
by the Lebesgue measure in the ambient space 2? D E^ . The submanifold E^ naturally 
has a (complete and locally compact) Riemannian structure induced by the EucHdean 
structure of the ambient space V. 

The image of the measure u by Ms "^p(~'/^^(j^)) where A is the so-called free 
energy defined by: 

A(z) = -/3-iln(ZsJ (4) 



where 



^s. = / |Ver^exp(-/3y)das, 



We assume henceforth that ^ and V are such that Zy,^ < oo. The free energy is actually 
defined up to an additive constant, the quantity exp(— /3A) being then defined up to 
a multiplicative constant, which disappears in the normaHzation of the probability 
measure j °^cxp{^^A(z))dz ■ ^^^^y algorithms in molecular dynamics [5] aim to compute 
the image of the measure fi by ^, which amounts to compute free energy differences, 
namely quantities of the form A{z) — A{zo). This is typically obtained by computing 
(and then integrating) the derivative A'{z), called the mean force. Using the co-area 
formula (see AppendixEl, the following expression for A'{z) can be obtained (see jSj, 
or the proof of Lemma [7] below) : 



A'{z) = Z^' f F I ^ exp(-/3y)das, , 



(5) 



where F is the so-called local mean force defined by 

(6) 



F = 



This can be rewritten in terms of conditional expectation as: For a random vari- 
able X with law /J,, 

A'iz) = E (Fix) = z) . (7) 



In practice, free energy profiles are used for example to compare the likeHhood of 
various conformations of a molecule, or to compute the rate of a chemical reaction. 
Free energy can also be useful to compute ensemble averages in the canonical ensemble 
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using the following formula (which is a conditioning formula): For any function 



V 



(j) dfis:^ exp{—PA{z)) dz 

<j,d^i^'-^^ , (8) 

/ exp(— /3^(z)) 

JM 

where is the probability measure /i conditioned to a fixed value z of the reaction 
coordinate: 

d^i^^ = Z^l\Vi\-^ exp(-/3^^)daE,. (9) 

Notice that |[5|) also writes A' (z) = Jj, Fd^-^^. Equation ([8]) may be interesting to 
compute averages in the canonical ensemble since, if the reaction coordinate is well 
chosen, it is expected that the sampling of the conditioned probability measure /is^ is 
easier than the sampling of ^ (the metastable features of the measure n being mostly 
in the direction of the reaction coordinate £). The sampling of can be done 
for example by projection of the gradient dynamics on (see [6]). The quantity 
/j^ 0<i/is^ can thus be evaluated by an efficient Monte Carlo procedure, and the 
computation of through ^ then only requires a one-dimensional integration, 

and the computation of the free energy (up to an additive constant). 

Due to the high dimensionality of the problem (the number of particles N is usually 
very large), methods to compute mean forces or free energy differences are Monte Carlo 
methods. They typically rely on the simulation of a diffusion Markov process. The 
most recent methods use non-homogeneous or non- linear Markov processes. Classical 
examples are exponential reweighting of non-equilibrium paths (based upon the so- 
called Jarzynski equality, see |1H 112)) or adaptive methods (see [Tj [9t fTOt I18jl. 

We are interested here in adaptive methods to compute free energy differences, and 
more precisely Adaptive Biasing Force techniques (see [3 [9]). The principle of adaptive 
methods is to modify the potential V during the simulation, in order to remove the 
metastable features of the simple dynamics while approximating the free energy A. 
Many methods have been proposed and we refer to for a unified presentation of 
these techniques, as well as a discussion of efficient parallel implementations. The 
aim of this paper is to propose a mathematical study of the Adaptive Biasing Force 
method to give a rigorous formulation and proofs of the following statements (which 
are the main arguments of practitioners of the field to advocate the use of adaptive 
methods): 

[SI] The adaptive biasing force technique helps to remove the metastable features of 
the simple dynamics ^ , and thus enables efficient exploration of the configura- 
tion space. 

[S2] With the adaptive biasing force technique, the free energy A is obtained in the 
longtime limit, and the convergence is exponentially fast in time. 



1.2 An Adaptive Biasing Force technique 

The Adaptive Biasing Force (ABF) method was introduced in [Tjl^ and is recast in a 
general mathematical framework in [13j. We propose to study here one version of this 
method, applied to the context of Brownian (or overdamped Langevin) dynamic|3- 

The ABF dynamics we propose to study is the following non-linear stochastic 
differential equation: 



dXt = -v{y - Ato^ + W oS^- 13-^ ln(|Ve|-2)^ (X*) \\I$,\-^{Xt) dt 
+ j2p^\VS,\-\Xt)dBt, 



^Such methods can also be applied for other dynamics, like Langevin dynamics. We only consider 
Brownian dynamics in this paper. 
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where W is an additional well-chosen potential that we will define below and At is the 
"free energy observed at time t". More precisely, the derivative of At with respect to 
the reaction coordinate is defined as (compare with ([7])): Vz G M, 



A'tiz)=E 


{F{Xt) 


aXt) - z) 





where F is defined by With a slight abuse of terminology, the function A't is 
called the biasing force. Notice that here and in the following, the notation ' denotes a 
derivative with respect to the reaction coordinate values, while the notation o denotes 
the composition operator. Equation (flTI) defines At up to an additive (time-dependent) 
constant, which does not modify (fTO]) . 

Compared to the simple dynamics three modifications have been made to 
obtain lfTOl) - lfTT|) : 

1. First and foremost, the potential V has been changed to the biasing potential 
V — At o ^. This is the bottom fine of the adaptive strategy. The algorithm we 
study here is prototypical of many adaptive methods used in molecular dynamics 
(see [l3]). In the original Adaptive Biasing Force technique as presented in [TjEI, 
the conditional expectation ifTTj) is actually "approximated" by some conditional 
averages over one single trajectory. The dynamics we study here is not clearly 
related with such a discretization, but rather with a discretization of using 
an interacting particle system, where many repHcas of the system contribute to 
the free energy profile (see [13j). 

2. Second, a potential W o ^ has been added. This is actually needed only in the 
case when M is an unbounded domain (we recall that M is the domain where 
the reaction coordinate lives). In theses cases, W is chosen so that the law 
of ^{Xt) converges exponentially fast to its longtime limit (more precisely, the 
Fisher information associated with this law converges exponentially fast to zero, 
see [H4] below for a more detailed statement). Besides, from a numerical point 
of view, such a potential is sometimes used in practice in order to separately 
sample some parts of the reaction coordinate space ^A (as in stratified sampling 
strategies) . 

3. Third, some terms depending on |V^| have been introduced. This modification 
is made in order to obtain a simple diffusive behavior for the law of £,{Xt) (see 
Proposition [1] below). It is expected that the longtime convergence oi A't towards 
A' still holds without this modification, by simply considering the gradient dy- 
namics 

dXt = -V(y ~Ato^ + Wo ^){Xt) dt + y^W^dBt, (12) 

with the same definition (fTTj) for AJ. However, we are only able to prove a weaker 
convergence result in this case. This is the matter of Sections l2.3l and [374l Notice 
that if |V^| is constant (for example if ^ is a length), a simple change of time 
relates IE]) with |(10]). Notice also that if we take At = W = Q\n then Xt 
samples the original Gibbs measure ^ defined by H]) (see Equation ^ above). 

Remark 1 (On the computation o{ A't(z)) From a practical point of view, with 
the additional terms mentioned in item 3 above, it is possible to compute the biasing 
force A^{z) without explicitly evaluating F since (by Ito's calculus on Xt that satis- 
fies ( tiOp . and assuming W — Q for simplicity) 



F{Xt) dt - d^iXt) + A'ti^iXt)) dt ~ y^2p^-^^(Xt) ■ dBt. (13) 
By a simple finite difference scheme, we thus have the following approximation 



4 



1.3 A PDE formulation and presentation of the main result 



We would like to emphasize that our arguments are partially formal: we assume that 
we are given a process Xt and a function which satisfy (fTOl) - lfTT|) . and such that 
Xt has a smooth density ip{t, •) with respect to the Lebesgue measure on V. We 
suppose that this density is sufficiently regular so that the computations are valid. In 
particular, we assume that the potential V is such that either the stochastic process 
Xt lives in T> and thus that its density tp(t, •) decays sufficiently fast on dV or the 
stochastic process Xt has some reflecting behavior on dV and thus that its density 
•) has zero normal derivatives on dV. In both cases, no boundary terms appear 
in the integrations by parts we perform to derive the entropy estimates. We refer 
for example to |i3j for an appropriate functional framework in which such entropy 
estimates hold. 

Since only the law of the process Xt at a fixed time t is used in ifTTI) . it is possible 
to recast the dynamics in the following nonlinear partial differential equation (PDE) 
on the density -0(^7 •) of Xt: 



' dtip = div (|VCr^ {V{V ~ Ato^ + W o^,)ip + r'^Vip)) 

[ F|verv(t,-)rf<^s. 

A'tiz) 



iverv(i,-)daE. 



(14) 



where F is defined by This is obtained by using the fact that if Xt has law 

ip{t, x) dx, then the law of ^(Xj) is i'^it, z) dz with 



|VerV(i, Others. 



(15) 



and the conditional law of Xt with respect to S,{Xt) = z is iJLt,z defined by 



d^t 



0(^,-)|VerMas. 
i^i{t,z) 



(16) 



The probability measure ip^it, z) dz is the image of the probability measure -(/'(i, x) dx 
by ^. These expressions can be obtained using the co-area formula (see Appendix [A]) . 

Before presenting the results, we would like to motivate the introduction of this 
dynamics by the following formal observation. If the potential At and the law of Xt 
reach a stationary state, then, from the dynamics (fTO|) on Xt (or from the partial 
differential equation l|14p satisfied by the distribution of Xt), we observe that this 
stationary law is proportional to exp(— /3(F(a;) — Ao^ o ^(x) -\-W o S^{x)))dx, where 
AoQ denotes the stationary state for At (this requires a uniqueness result for the 
law of Xt, which holds for example if |V^| is uniformly bounded from below by a 
positive constant). Then, from the definition ifTTj) of the biasing force, we obtain that, 
necessarily, A'^ = A' (where A is the mean force defined by |[5|)). This proves the 
uniqueness of the stationary state for this dynamics. We can thus expect that A't 
converges to the mean force A' in the longtime limit. 

The interest of the dynamics lfTOl) - lfTT|) is actually twofold. First, as expected from 
the formal argument above, in the longtime limit, A't converges to the mean force A' 
defined by |[5]) (see Equation (|24l) below). Second, using the ABF method, the law 
of S,{Xt) has a simple diffusive behavior (see Equation l(20|) below). The metastable 
feature of the simple dynamics ^ along ^ is thus corrected by the addition of the 
adaptive potential At . The aim of this paper is to give a precise statement for these 
two assertions, which are mathematical formalizations of the two main characteristics 
[SI] and [S2] of adaptive techniques mentioned in Section [TTTl The proof of the long- 
time convergence relies on entropy techniques, and requires appropriate assumptions 
on the potentials V , W and the reaction coordinate ^. We prove that under suitable 
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assumptions, the convergence of A'^ to A' is exponentially fast, with a rate of con- 
vergence limited, at the macroscopic level, by the rate of convergence of the law of 
£,{Xt) to its longtime limit, and, at the microscopic level, by the rate of convergence to 
the equilibrium conditioned probability measures /is^ , for all values z of the reaction 
coordinate. 

All these results are more precisely stated in Section [2l and the proofs are given in 
Section [31 We would like to mention that the main arguments of the proof are given 
in a very simple case in Section 13.11 and that we also present a result of convergence 
for the dynamics (IT2])-(ITTI) in Section [2Jl 



2 Precise statements of the results 

In Section \2A\ we recall some well-known results on entropy and introduce the main 
notation used in the following to state the convergence result. Section [231 is devoted 
to the presentation of the convergence result for the dynamics l|10p - l[TTl) . Finally, we 
give in Section 12.31 a (weaker) convergence result for the dynamics |[T2|-([TTJ . 



2.1 Entropy and Fisher information 

Let us consider and A'^ which satisfy ([141) ^^d let introduce the long-time limit of 
■0, V'^ (defined by ([151) ) ^iid fit.z (defined by l[T6l)): 

ij^ - (ZZ«)-i exp(-/3(F ~Ao^ + Wo 0), 



where 



We recall that 



= I cxp(-/3W(z)) dz. 



Z^. = [ \V^\'^eM-PV)das^, Z = [ cxp 



i-PV{x))dx. 

Notice that V'oo — 1, and that the probability measure 7/;|^(z)dz is the image of 
the probability measure '0oo(a;) dx hj ^. 

In order to state the results, we also need to introduce the following projection 
operators. For any x G P, we denote by 

the orthogonal projection operator onto the tangent space T^^S^j^^-j to S^^^,) at point x, 
and by 

the orthogonal projection operator onto the normal space Nx^^(^x) to ^^(x) at point x. 
We denote by the tensor product: For two vectors u,vG'D,u(^v is anxn matrix 
with components (u (8) v)ij = UiVj. 

We measure the "distance" between tp (respectively tp^) and -000 (respectively ■f/'lo) 
using the relative entropy H{^p\^poo) (respectively i?(V'^|V'|o))j where, for any two 
probability measures fi and v such that is absolutely continuous with respect to v 
(this property being denoted <C in the following), 



Hi^^W) = |ln (^^) d^,. 
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We recall the Csiszar-Kullback inequality: 



\\^l-v\\Tv < V^H{ii\i^) 



(17) 



where — i^Wtv = supy^ ||/||^oo<i{/ /'^(m ^ ^)} denotes the total variation norm of 
the signed measure fi — v. When fi and ly both have densities with respect to the 
Lebesgue measure, ||^ — i^Wtv is simply the norm of the difference between the two 
densities. 

We denote the total entropy by 

£;(i) = iJ(V(t,-)IV'oo), 

the macroscopic entropy by 

the "local entropy" at a fixed value z of the reaction coordinate by 

and the microscopic entropy by 

E^{t) = I e^{t,z)'tl;^{t, 

JM 



z) dz. 



It is straightforward to obtain the following result which can be seen as the extensivity 
of the entropy: 

Lemma 1 It holds 

E{t) ^ EM{t) + Era{t). 

Let us now introduce the Fisher information: For any two probability measures n 
and V such that ^ v, 

2 



Vln(^ 

av 



dfi. 



(18) 



In the case is a probability measure on the (Riemannian) submanifold , V actually 
denotes the gradient on in (flSl) . namely 



Vs. = i"V. 



(19) 



Therefore, for the conditional probability measures ^t.z and ^oo,z, the Fisher informa- 
tion writes 



S- 



Ve_, In 



V'oo 



^(t,.)|vei-Mas. 

i,i{t,z) 



Let us finally introduce another way to compare two probability measures, namely 
the Wasserstein distance with quadratic cost: for two probability measures /i and v 
defined on a Riemannian manifold S, 



W{n,iy) = \ inf / d^{x,y)^ dTr{x,y). 
In this expression, denotes the geodesic distance on E: Vx, y G S, 

weC\[0,l],J:),w{0)^x, w{l)=y\, 



dY.{x,y) = inf ■ 



\w{t)\^dt 



where n(/i, v) denotes the set of coupling probability measures, namely probability 
measures on E x S such that their marginals are /i and v. We need the following 
definitions: 
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Definition 1 The probability measure v is said to satisfy a logarithmic Sobolev in- 
equality with constant p > (in short: LSI(p)) if for all probability measures p. such 
that fi <^ ly, 

H{p\v) < Y^Hf^W)- 

Definition 2 The probability measure v is said to satisfy a Talagrand inequality with 
constant p > (in short: T(p)) if for all probability measures fj, such that p, <^v, 

W{p, v) < \j-H{p\v). 

In the latter definition, we implicitly assume that the probability measures have finite 
moments of order 2. This will always be the case for all the probability measures we 
consider. We will need the following important result (see pjj, Theorem 1]). 

Lemma 2 If v satisfies LSI(p), then v satisfies T(p). 

For an introduction to logarithmic Sobolev inequalities, their properties and their 
relation to longtime behavior of solutions to PDEs, we refer to [2} ISj I16j. 



2.2 Convergence of the adaptive dynamics (I10n- (llin 

We are now in position to state our main results. Concerning the dynamics on the 
law of S,{Xt), we have: 

Proposition 1 (Equation satisfied by the marginal density t/;^) Let (V', A^) be 

a smooth solution to {14^ and let us assume [HI]. Then -0^ satisfies the following 
equation: 

dti^^ =d^{W''4}^ + (3-^d,il^^) on M. (20) 

Remark 2 Notice that even if ip^ satisfies a closed PDE, £,{Xt) does not satisfy a 
closed SDE (see Equation I113\} above). 

The fundamental assumptions we need to prove longtime convergence are the fol- 
lowing (we recall that the local mean force F is defined by ([6])): 

J , / ^ ^^"^ ^ sufficiently differentiable functions such that 
^ ' I ||VC||L=o<m<ooand||Vs.F|li^ <M<oo, 

frrni / ^ ^^d ^ are such that 3p > 0, for all z £ M, 
^ ' \ the conditional measure poc>,z satisfies LSI(p). 

In Assumption [H2], the requirement on F can be seen as a boundedness condi- 
tion on the coupHng between the conditional measures Poo,z and the corresponding 
marginal since it involves the mixed derivatives (along the tangential space and 
the normal space of the submanifold E^) PV{QW) (see [14] and Remark [TT] below) . 

Assumption [H3] ensures that if, for a fixed value z of the reaction coordinate, 
the conditioned probability measure poo.z were to be sampled by a simple constrained 
gradient dynamics (see [6]), the convergence to equihbrium would be exponential with 
rate p. We refer to p as the microscopic rate of convergence in the sequel. 

We refer to Section [STTl for an explicit framework where [H2] and [H3] are satisfied, 
and to Remark [3] below for alternative assumptions on V and ^. 

Let us now introduce the assumption we need on W. 

[H4] W is such that 3Io > 0,r > 0, Vi > 0, /(V'^(i, < /q exp(-2/3-i r t). 

Assumption [H4] is indeed an assumption on W because satisfies the PDE (|20|) 
where only W appears. Assumption [H4] ensures that the law of ^(Xt) converges to 
equilibrium exponentially fast with rate r, which we refer to as the macroscopic rate 
of convergence in the sequel. 

We will see below (see [H4']) some sufficient explicit conditions on W for [H4] to 
be satisfied. 
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Theorem 1 (Exponential convergence of the entropy to zero) Let us assume 
[HI], [H2], [H3] and [H4]- Then the microscopic entropy Em satisfies: 



VEmit) < Cexp(-Ai) (21) 
where C = 2 max (^y%jO), ^-i|pf,f-2_^| and 

A = f]^^ min(pm"^, r). (22) 

In the special case pm' -2 = r, Em satisfies ^Em{t) < ( V^™(0) + Af y^t) exp{-f3-^rt). 

This implies that the total entropy E and thus \\ip{t, •) — "000 ||li(x)) both converge 
exponentially fast to zero with rate X. 

We thus obtain that the biasing force A[ converges to the mean force A' in the 
following sense: Vt > 0, 

r o /If 2 

/ |4-Af (z)V«(i,z)dz< Emit). (23) 

Jm P 

Notice that the fact that E and \\%lj(t,-) — V'oo||li(i5) converge exponentially fast to 
zero with rate A is an immediate consequence of l(2T|) . [H4], Lemma [1] and the Csiszar- 
Kullback inequality lfT7|) . 

We will actually consider the two following cases for which [H4] is satisfied: 



\iM = T, then W = {) 
fH4'l I If = IK, then is a potential such that W" is bounded from below 

' cxp(-^WO_ 



and there exists r > such that /-"^^^^ t^vV] satisfies LSI(r). 



Notice that in the case = M, the assumptions stated in [H4'] on W are satisfied 
for an a-convex potential (namely if W" > a for a positive a) , and then it is possible 
to choose r = a in [H4] (see Lemma [13] below) . We refer to Remark [4] below for 
alternative assumptions on W. 

Corollary 1 (Convergence of the biasing force) If [H4'] is satisfied and tp^ sat- 
isfies (2U\) then [H4] holds. 

More precisely, ifAi = T and W — 0, then [H4] is satisfied with Iq — I{ip^{Q, OIV'lo) 
and r — An"^ . If M = M, W" is bounded from below and j- °Txp(-ffli/) satisfies LSI(f), 
then [H4] is satisfied with r = r — e for any e G (0, r). 

Let us now assume [HI], ]H2], ]H3] and [H4']. From I123\) . we deduce that for all 
compact K C M,3C,t* > 0,\/t>t* , 

\A[-A'\{z)^l{z)dz<CeM->^t), (24) 

K 

where A is the rate of convergence defined by f22\} in Theorem [H 

These results therefore show that A'^ converges exponentially fast to A' (in 
L^{ipl^{z) dz)-norin) at a rate A — (3~^ mm{pm~'^ ,r). The Hmitations on the rate 
A are related to the rate of convergence r at the macroscopic level, for the equa- 
tion l|20p satisfied by and the rate of convergence at the microscopic level, which 
depends on the constant p of the logarithmic Sobolev inequalities satisfied by the con- 
ditional measures fioo,z. This constant of course depends on the choice of the reaction 
coordinate. In our framework, we could state that a "good reaction coordinate" is such 
that p is as large as possible. 

The proof of these results is given in Sections [3.11 13.21 and [3^ below. 

Remark 3 (Other possible assumptions on V and ^) We would like to mention 
other possible assumptions on V and ^ than ]H2]-]H3] for which the results of Theo- 
rem{^ still hold. 
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First, in [H2], it is possible to change the assumption ||Vi;^F||^oo < M < oo to 



I^IIl- <M <oo. 



Indeed, this simply changes the estimate i35]) in Lemma\J^ below to the following 

- A'iz)\ < \\F\\^^ yt.z - ^io.ATV, 



< M^2H{pLtAl^oo,z). 

by the Csiszar-Kullback inequality ( ti?)) . The rest of the proof remains exactly 
the same. 

Second, it is possible to obtain a similar result of convergence under slightly dif- 
ferent assumptions than [H2]-[H3] by introducing another Riemannian structure 
on the submanifolds Yi^. This is made precise in Appendix\B\ (see assumptions 
[H2']-[H3']). 

Remark 4 (Other possible assumptions on W) From Lemma \1SI and \13\ below 
(used to prove Corollary]^, it will become clear that [H4] is actually satisfied with 
W — as soon as A4 is a bounded domain. If Ai is an unbounded domain, then a 
potential W with properties such as those stated in [H4 '] is needed. We discuss in this 
remark other properties on W to satisfy [H4] than those proposed in [H4'], in the case 
A4 —M. (or Ai is an unbounded domain). 

In this case, it is actually also possible to satisfy [II4] by choosing W such that the 
dynamics is confined in a domain UzgaA where N is a bounded subset of M. This 
can be done by using a sufficiently confining potential W and adapting Lemma[W\ be- 
low, or by adding reflexion terms to restrict^ to Af (which loosely speaking corresponds 
to take W zero on Af and infinite on Ai\Af ) and adapting Lemma{TM below. 

Let us make precise this latter case. Suppose for example we are interested in 
the values of A'{z) for z £ N = {0,1). The dynamics is confined in the domain 
O = Uo<z<i The ABF dynamics is 

dti, = div (|VCr^ (V(V" - At o + ) , on O, 

(V(l/ - At o + /3"^VV7) • Ve - 0, on So U Si, 

A'tiz) - , for z e (0, 1), 

where F is defined by (0). From the point of view of the stochastic process Xt, 
the boundary condition translates to a normal reflexion on the two submanifolds So 
and Si. Moreover, it can be checked (using Lemma\^ that the boundary condition 
on tp translates to a zero Neumann boundary condition on : dzip^{0) = dzip^{l) = 0. 
A proof similar to that of Lemma[IM then shows that I{ip^\ip^) converges exponen- 
tially fast to 0, so that [II4] holds. The arguments we use to prove Theorem]^ and 
Corollary]^ then show that \\A[ — A'||i2(o.i) goes to exponentially fast. 

Remark 5 (Vectorial reaction coordinate) In this work, we assume that the re- 
action coordinate ^ has values in T or M. The dynamics ilO]} - (Tl\} and the results 
of convergence presented in this section can be straightforwardly extended to the case 
when ^ = (^1, ■ • ■ ,^m) has values in T™ or M™, with 2 < m < n, under the orthogo- 
nality condition: 

Vz ^ j, VC. • = 0. (25) 

The generalization of this dynamics to non orthogonal reaction coordinates is unclear. 
In this case, it is possible to resort to metadynamics (see Remark{di below). Alterna- 
tively, the dynamics fl2\) -(Tl ]) (and the result of convergence of Section flTgl for this 
dynamics) can straightforwardly be generalized to a vectorial reaction coordinate. 
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Remark 6 (Metadynamics) The adaptive biasing force technique can also be used 
in the context of metadynamics \1(A \^ The principle of metadynamics is to 

introduce an additional variable z with dimension the dimension of ^ (say z e M™, 
with 1 < m < n), and an extended potential V(^{q,z) = V{q) + ^\z — ^{q)\'^ . The 
reaction coordinate is then chosen to be £,mcta.{q, z) = z so that the associated free 
energy is 

A^{z)^^p-^\n ( cxp(-/3\^ag,z))d(7, 

which converges to A{z) when C goes to infinity. In our framework, the ABF method 
applied to this extended system writes: 

( dXt = i-VV{Xt) + - aXt))yaXt)) dt + y^W^dBu 
1 dZt = cUiXt) - m{Xt)\Zt)] dt + ^/2fF^dBu 



where Bt is a m-dimensional Brownian motion, independent of Bt . Notice that by 
construction, the orthogonality condition l|25p is satisfied by ^mcta, so that the conver- 
gence results of this section apply to these kinds of models. 

Remark 7 (On the initial condition) lfip^{0,-) is zero at some points or is not 
sufficiently smooth, then A'q may be not well defined or l{ip^{0, •)IV'lo) infinite 
(which is in contradiction with [H4]). But since we show that tp^ satisfies a simple 
diffusion equation (see Proposition W), these difficulties disappear as soon as t > 0. 
Therefore, up to considering the problem for t >t^, > 0, we can suppose that ip^{0, •) > 
0. 

Remark 8 (On the choice of the entropy) In the case of linear Fokker Planck 
equations, it is well known that one can obtain exponential convergence to equilibrium 
by considering various entropies of the form J h (^^U^ '^t^' '^here h is typically a strictly 
convex function such that h{\) = (see 1^1 for more assumptions required on h). For 
example, the classical choice h{x) = ^{x ~ 1)^ is linked to Poincare type inequalities 
and leads to iP' -convergence, while the function h{x) — x\n-x — x-\- \ we have used here 
to build the entropy is linked to logarithmic Sobolev inequalities and leads to InL^- 
convergence. However, for the study of the non-linear Fokker Planck equation {14^ , it 
seems that the choice h{x) = xhix — x -\- 1 is necessary to derive the estimates, for 
example to have the extensivity property of Lemma H 

Remark 9 (Smoother evolution in time of A^) In practice, it may be useful to 
update the adaptive potential A'^ in a smoother way in time, for example by replac- 
ing fnp by 

dA[{z) = i (e \aXt) - z) - A[{z)) dt, 

where F is defined by and t > denotes a characteristic time (possibly depending 
on {t, z) ), to be fixed. This amounts to replace A'^ by Kt * A'^ in where Kr is an 

exponential convolution kernel. Formally, we here consider the limit case t — 0. To 
prove the convergence of A[ towards A' for t ^ is an open problem. 

Remark 10 (Enhancing the macroscopic rate of convergence) Let us consider 
the case = R. For an a-convex potential W , Corollary [I\ states that A'^ converges 
towards A' exponentially fast, with a rate X = (3^^ mm.{pm~'^,a). This may seem 
surprising since for large enough a, the rate of convergence is no more limited by a. 
However, it is typically expected that the constant Iq in assumption [H4] increases 
with growing a, which means that the constant C increases in the convergence esti- 
mate ([Hp. Moreover, in practice, if a is very large, is very peaked and some 
parts of M are poorly sampled, so that the variance of the result is large in these 
areas (which can not be seen in our convergence result). Actually, a good method to 
enhance the rate of convergence at the macroscopic level while keeping a good sampling 
and thus low variance, is to use a particle systems with many replicas and a selection 
mechanism. We refer to J 13^ for more details. 
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2.3 A convergence result for the adaptive dynamics (I12l)- (llll ) 



In this section, we present a weaker convergence result for another adaptive over- 
damped Langevin dynamics, namely (fT2l) - lfTT|) . For simplicity, we only consider the 
case 

M = T andW = 0, 

but the results can be extended to the case = M with a suitable Vt^ 7^ 0, as in 
Section [221 (see [H4] and [H4']). One interest of this dynamics and this result of 
convergence is that they can be straightforwardly extended to the case of a multi- 
dimensional reaction coordinate (see Remark [5] above) . For the sake of conciseness, 
we do not provide the details of the result in this case which follows exactly the same 
lines (see ^ and Appendix [A] for formulas in the case of a multi-dimensional reaction 
coordinate). Let us recall the dynamics lfT2|) - (fTT1) we consider here: 



dXt = -V{V- At on{Xt) dt + y/2p^dB-, 



with the same definition as before for At: Vz e T, 



A't{z)=E(^F{Xt)\aXt) = z), 



(26) 



(27) 



where F is defined by The associated non-linear Fokker Planck equation is now: 



' dt^p^ div {V{V - Ato^)'iJj + p~^Vtlj) 
[ F\\/^\-'yj{t,-)das^ 

A'tiz) 



iver orffTs, 



(28) 



The main difference with the dynamics lfT0 |) - (fTT1) considered in Theorem [T] is that 
the marginal distribution tp^ does not satisfy a closed partial differential equation. 
Therefore, we do not know a priori that the Fisher information I{ip^\^p^) converges 
to 0. The strategy here is to directly estimate the derivative of the total entropy E. 
We obtain a convergence result under two additional assumptions (see [H5]-[H6]). 

Theorem 2 (Longtime convergence for the dynamics ^2^-^^) Let {ip,A't) 
be a smooth solution to [2^) and let us assume [HI], [H2], [H3]. Moreover, we suppose 

[H5] V and ^ are such that 3R > 0, V'oo satisfies LSI(R), (29) 



and 



[H6] < 1. 



2VP 



Then the total entropy E satisfies: 



VW) < /b(0) cxp(-At) 



jyf) R is positive using [H6]. In particular, as in Theorem]^ 



where X = j3 ^ (^^1 1 2^ ) 
the biasing force At converges exponentially fast to the mean force A! . 



The proof of this result is given in Section [3741 below. 

Remark 11 (On assumption [H5]) In 114\ Theorem 2], it is shown that if fi = 
eiip{—H{xi,X2))dxidx2 is a probability measure on a product space X = Xi x X2 
(where Xi are Euclidean spaces), if the conditional probabilities iJ,{dx2\xi) satisfy 
LSI(p2) (with p2 independent of xi) and the marginal Jl{dxi) satisfies LSI(pT), then fi 
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satisfies LSI(p) provided the coupling between the two directions is hounded: 3ki 2 > 0, 

Thus, in the simple framework of Section \ 3.1\ for example, where the configuration 
space isTxR and the reaction coordinate is ^(cc, y) = x, the fact that V'oo satisfies a LSI 
(assumption [H5]) can be deduced from the fact that the conditioned distributions fioc.z 
satisfy a LSI (which is [H3]), the marginal tpl^ satisfy a LSI (which is related to [II4]) 
and the coupling is bounded (which is [112]). Thus [115] is not needed as an additional 
assumption compared to the framework of Theorem[I[ The generalization of this result 
to the case when X is not a product does not seem to be straightforward. 



3 Proofs 

One remark to simplify the presentation of the proofs is that we can suppose l3 = 1 
up to the following change of variable: t = l3~^t, Tp{t, x) — ipit, x), V{x) — l3V{x) and 
W{x) = (3W{x). Therefore, we suppose in the following that 

13=1. (30) 

3.1 Proof of Proposition [1] and Theorem [1] in a simple case 

In this section, we propose to prove Proposition [T] and Theorem [T] in the simple case 
n = 2, ^{x,y) = x (so that we use in this section the notation x instead of z for the 
reaction coordinate variable) and the configuration space is I? = T x M (which means 
that all the data are periodic with respect to the first coordinate x). In this case, we 
thus have ^ e T (7W = T) so that we choose W = (see [H4']). Notice also that the 
local mean force F is simply given hy F = d^V (see Our aim is to introduce the 
main arguments in this simple case before presenting the general proof in Section 13.21 
In this simple setting, the system (fT4| writes (recall /? = 1): 

dti^ = div {VVyj + Vyj) - dxiA't^p), 

/ dxV{x,y)ip{t,x,y)dy (3I) 

A[{x) = 



t^Ht,x) 

where ip^^t^x) = J^ip{t,x,y)dy. Notice that in this case V-'lo = 1- 

It can be checked that the assumptions [H2] and [H3] are satisfied in this context 
for a potential V of the following form: 

Vix,y) = Voix,y) + Viix,y) 

where infTxR^y^yVb > 0, ||Fi||loo < 00, \\dx,y{Vo + Vi)\\l^ < 00. The potential V is 
thus a bounded perturbation of an a-convex potential, with a bounded mixed deriva- 
tive dx^yV. Then, assumptions [H2]-[H3] are satisfied with m = 1, M = |19a;,j,V^||L°° 
and p = (infTxR dy^yVo) exp(-osc Vi), where osc Vi = suptxk 14 -infTxR Vi (see [2j). 
Proposition [1] is simply obtained by integration of (|3T|) with respect to ?/ € R: 

Lemma 3 The density satisfies the following equation on T: 

dt^^ = dx^x^^. (32) 

As stated in Corollary [U this result already yields the exponential convergence to 
zero of the macroscopic Fisher information ^(''/'^iV'lj) (this is the matter of Lemma [121 
below), and thus [H4] is indeed satisfied with /q — lii^^iO, ■)\'4^L>) and r = 47r^. 

A fundamental lemma needed in the sequel is 
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Lemma 4 The difference between the biasing force A[ and the mean force A' can be 
expressed in term of the densities as 

Proof : This is a simple computation (using the fact that = 1) 



In ( -f— I dy 

Woo J 



dx In ( ^ [ dx In V"-^ '^V ~ ( 1^ i>oo-TE dy - In V'^, 

\ip%cJ Jr r Jr 



^dy+ [ d,iV^A)^dy-d,\n^^, 
V"^ Jr V'^ 



^A-A'. 







We will also use the following two estimates: 
Lemma 5 Let us assume [H2]-[H3]. Then, for all t > 0, for all x G T, 



\A',{X)-A'{X)\ < \\dx,yV\\L^^-era{t,x). 

Proof : For any coupHng measure tt e n(/Ltt_a;, /ioo,a;), it holds: 



\A',{x)-A'{x)\ 



{d,V{x,y)-d,V{x,y'))7r{dy,dy') 



< Wd.^yVW 



< \\d.,yV\\L^ 



\y-y'\T:{dy,dy'), 



\y - y'\^TT{dy,dy'). 



Taking now the infimum over all tt €E Ii{fM,x, fJ'oo.x) and using [H3] together with 
Lemma O we obtain 



\A',ix)-A'{x)\ < \\^,^yV\\L^Wi^it ) < \\dx,yV\\L=°\j -H{fIt,x\^J■oo,x), 

which concludes the proof. 

Lemma 6 Let us assume [H3]. Then for all t > 0, 







Proof : Using [H3], it holds: 

Era = / emi>^ dx, 



^2/ 111 1 , 



V'- 



< 



T 2p 



« 1 I 



?/;^ dx, 



which yields the result since does not depend on y. 







We are now in position to prove the exponential convergence of Em{t) to zero 
stated in Theorem [T] (see Equation (|2T|) ). 
Equation ((3T|) on %p can be rewritten as: 



= div (VooV(V'/Voo)) + d,{{A' - 4)^). 
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Notice that the derivative ^ can be obtained by multiplying this equation by In 
and integrating over T x R. Thus, one obtains after some integrations by parts, using 
a Cauchy-Schwarz inequality (to prove that (|33|) is non positive) and Lemma [4] (used 
twice): 

dEm _ dE dEM 
~ ~dt 



dt 



dt ' 
Vln 



A. 



\p+ f f (4-minf-^)^+ / 

Jt Jm \VooJ Jt 



dr In 



< - 



dxln 
dx In 



dy In 



di, In ( ^ 

-000 

A 

l/'oo 

± 



dx In 

2 

V'- 



<9xln 



0- 



l/'oc 

In 



ipdy 



2 



da; 



(33) 



We now use Lemmas [5] and [H 

dEm „ 
— f < -2p^™ + 
dt 

< -2pE,n + 
Using [H4], we thus have: 

d^/EZ' 



\dx,yV\\L^^l^,^Jliib^ijL) 



dt 



< -pVEm + \\dx,yV\\l 



■exp(-rt), 



from which we deduce (|2T1) . 

Equation (|23|) is then easily obtained using Lemma [5l 

3.2 Proof of Proposition [1] and Theorem [1] in the general case 

We now present the proof of Proposition [T] and Theorem [T] in the more general setting 
of Section 12.21 The proof follows the same lines as in the simple case presented in 
Section 13.11 but with additional difficulties related to the geometry of the submani- 
folds S^. 

We need the following result 

Lemma 7 The derivative of ip^ with respect to the reaction coordinate value reads: 



d.^P^{t,z) 



Proof : For any smooth test function g : M 
formula (|39|) and an integration by parts): 



we obtain (using the co-area 



M 



^^{t, z)g' (z) dz — / ip{t,x)g' o ^[x) dx^ 
Jv 

i:(t,x)V[goi).V£,\V(,\-\x) dx, 
V'(i,-)Vr 



V 



= - / div 
Iv 



M 



/ve-v0(t,-) 



div 



( 



V^i,-) iVer'das, dz, 
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which yields the result. <) 

Using this lemma, it can be shown that ip^ satisfies a simple diffusion equation, which 
is Proposition [TJ 

Lemma 8 The density ip^ satisfies the following diffusion equation on Ai: 

dti>^ = {w'ijj^ + • (34) 

Proof : For any smooth test function 5 : ^ M, we have (using the co-area 
formula ([39l) . (fT4l) . an integration by parts and finally Lemma [Tj): 

d f d f 

di Jj^^^^^''^^^^ ^ di J^'^^^''^9 ° ^dx, 

= I div {\\/^\-^{\/{V-Ato£_+Wo£_yj + V^j))go^dx, 
Jv 

= - [ I V?r^ {\7{V - Ato^ + W o^)^l; + W) ■ o ^ dx, 

Jv 

= - I I vcr ' (VF • VC^A + • VO g'o(,dx 

JV 

+ / a; o ^g' o ^7/> da; - / W' o^g' o^ipdx, 
Jv Jv 

= - I I |V^|-3 iyv ■ V^V + VV' • V<e) daT^^g'iz) dz 

J M Jy,^ 

A't{z)g'{z)',j^{z)dz- [ W'{z)g'{z)i:^{z)dz, 

M Jm 

I I (iver^v^- vc+div(veivcr')iverv) rf'Ts,5'(^)rf^ 

J M JY^ 

W'{z)^^{z)g'{z)dz, 

M 

{d:,^^{t,z) + W'{z)tl)^z))g'{z)dz, 

I M 

which is a weak formulation of l(34|) . ■O 

As stated in Corollary [J, this result already yields the exponential convergence 
to zero of the macroscopic Fisher information /(■(/'^IV'lo) under adequate assumption 
on W (this is the matter of [H4'] and Lemma [13] below). We suppose in the following 
that [H4] is indeed satisfied. 

The equivalent of Lemma [4] writes 

Lemma 9 The difference between the biasing force A[ and the mean force A' can be 
expressed in term of the densities as 
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Proof : Using Lemma [7] and the definition of it holds: 



-000/ 



. |vei 

dz In ■0^ + 9z In 

1 /• vc-vv,„,, 2 







, |v^| 



div 



0« 0« 
- A'(z)-9^1n^«, 



(i(Ty 



The equivalent of Lemmas [5] and [6] write: 

Lemma 10 Let us assume [H2]-[H3]. Then for all t>Q, for all z £ A4, 

\A',iz)-A'iz)\<M^-^e^{t,z). 
Proof : For any coupHng measure tt e n(/it,z, ^oo,z) defined on x S^, it holds: 



|4(z)-A'(z)| = 



(F(a;) - F{x'))n{dx,dx') 



(is. (a;, a;')27r(da;, dx'). 



Taking now the infimum over all tt G n(^t,2, //oo.z) and using [H2]-[H3] together with 
Lemma [21 we thus obtain 



which concludes the proof. 

Lemma 11 Let us assume [H3]. Then for all t > 0, 



(35) 




Emit) < 

Proof : Using [H3], it follows: 



2p 



V 



Vs. In 



Ipoc 



Em — 



< 



iip^ dz, 



M 



M 



2p 7s. 



Vs. In 



i,i{t,z) 



which yields the result, using the co-area formula ([39 





We are now in position to prove the exponential convergence of Em{t) to zero 
stated in Theorem [1] (see Equation (|2T|) ). Equation lfT4|) on can be rewritten as: 



9tV = div (|VCr VooV(^/V'oo)) + div (iVer^V {{A - A) o ^) V). 
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Notice that the derivative ^ can be obtained by multiplying this equation by In 

and integrating over V. Thus, one obtains after some integrations by parts, using the 

CO- area formula (|39| and Lemma [H 



dEjn dE dEM 
dt dt dt ' 



V 



M 



V 



Vln 

Ve. In 



A. 

Ipoo 



Jv 



V'oo 



2 



M 



Ve, In 



|vei 



Vln 



A 

Ipoc 



9, In 



Using the Cauchy-Schwarz inequality: 



< 



|vei 

/ V? 



Vln 



and Lemma [9] again, we thus obtain 



A 



< 



•D 



Vs, In 



A 

Ipoc 



\tp6c/ 



We now use [H2], Lemmas flOl and flTl 

dE, ' 



dt 



< ~2pm-^E^ + M-^-^E„,\J I(ip^ipic). 



a. In 



V4 



Using [H4], we thus have: 



dt 



lo 



< —pm V E„i + M J — exp(— r<). 



2p 



from which we deduce ((2T|) . 

Equation ((23|) is then easily obtained using Lemma flOl 
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3.3 Proof of Corollary [U 

3.3.1 Convergence of the macroscopic Fisher information 

Let us first show that in both cases considered in [H4'], the exponential conver- 
gence [H4] of the macroscopic Fisher information indeed holds. 

Let us first consider the case = T and = 0. We know from ^ that 
satisfies dtip^ = dz,zi^^ on T, and we would like to show exponential convergence of 
the Fisher information I{tlj^{t, OlV'lo)- 

Lemma 12 (Convergence of the Fisher information when A4 = T and W ~ 0) 

Let 4> he a function defined for t>Q and a; e T which satisfies 

dt4> — dx.xfp on T 

and such that /^^(O, •) — 1, 0(0,-) is non negative, and l{(l>{0, ■)\(f)oo) < oo, where 
(j)oo = 1 is the longtime limit of 4>. Then, \/t > 0, 

I{(l){t, Ol^oo) < /(0(O, Ol-^oo) exp(-87r2i). 

Proof : Let us denote u ~ \f^. We notice that /(^|^oo) — Jj \dx In0p0 — 4 /-j, 
Moreover, we have from l(32|) 

a ^3 , (^^")^ 
dtu = Ox.xU H • 



Therefore, 



= 2 / dx,x,xu dxU + 2 I dx[ i^^^ ) dxU, 



u 

2 



-2 / {dx,xur 2 / ^-^dx,xU, 
JT JT U 

dx{{dxuf) 



idx,xuy - 2 

r 

-2 / {dx.xu) 



3u 

2 f (dxu)^ 



3 

< -stt^ / {dxur, 

JT 

where we have used the Poincare-Wirtinger inequality on T, appHed to dxu: For any 
function / € H^T), 

' 1 







Let us now consider the case A4 = M. and W ^ which is such that W" is 
bounded from below and j* "^c^pf^w) satisfies a logarithmic Sobolev inequality (as 

stated in [H4']). We know from ^ that -0^ satisfies dtip^ = {W + d^ip^) 
on ]R, and we would like to show exponential convergence of the Fisher informa- 
tion /(V'«(t,-)IV4)- 

Lemma 13 (Convergence of the Fisher information when A1 = M and W 0) 

Let 4> he a function defined for t > and a; G M which satisfies 

dt(t) = dxiW'(t) + dx<j)) onR, 

and such that /jj0(O, •) = 1, 0(0,-) is non negative, anrf /((/)(0, •)|(/)oo) < oo, where 
(jjoa = J'°exp7-vv") longtime limit of cf). Let us assume that W" is hounded from 
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below by a constant a and (j)oo satisfies LSI(f), with r > 0. We can suppose without 
loss of generality that 

r > a. 

Then there exists /q > and r > such that \ft > 0, 

/((/)(i,-)|0oo) < /oexp(-2rt). 

More precisely, when a = r > 0, it is possible to take Iq — -^(0(0, ■)\(t)oo) and r = a. 
When a < r, for any e G (0,f), it is possible to choose r — r — e for a well-chosen 
constant /q > 0. 

Proof : The fact that r > a is clear since either a < 0, or a > in which case it is 
well-known that 4>oo satisfies LSI(q!) (see for example |2]), so that one can choose at 
least r = a. 

Let us recall the expression for the entropy H{(p{t, ■)\(poo) — J^^^{<P / 4'oo)<l> and the 
Fisher information •)|0oo) — /r l^^ ln(0/(/)oo)P'/>- Since (j>oc satisfies LSI(r), we 

have 

i?(0(i,-)l0oo)< ^/(0(t,-)l'/'oo). 

Moreover, by standard computations (see for example [3]), we have 



-H(</)(t,.)|0oo) = -/(0(i,-)l0oo) 



and 



at 0OO 



dx,x In 



(36) 

If a = r, we thus obtain from |[36]) that ^/(^(i, Ol'/'oo) < -2a/(0(t, Ol^oo) which 
concludes the proof in this case. 

Let us now suppose that a < r. The technique of proof we propose is taken 
from [iTj . For any A > 0, we have 



dr In 



2A 



dx,x In - — 



2A / — 



< 



dr In 



{1 + 2XW")- 



dxln 



< -{l + 2\miW'')I{cf>{t,-)\cj,oo), 
1 + 2q:A 



< -- 



A + l/(2r) 
We thus obtain that, for any A > 0, 

i7(0(i,-)|0oo)+A/(0(t,-)|<^oo) < (i?(0(O,-)|0oo)+A/(0(O,-)|0oo)) exp 
and therefore 



1 + 2aA 
"A+l/(2r) 



x+i^/^-ir) Soes to 2r when A goes to 0, for any e G (0,r), one can find a A > 



Since 

such that ■^^'^y"^?) = 2(r — e), which concludes the proof. 
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3.3.2 Convergence of the biasing force 



Let us now prove the convergence result l(24|) for the biasing force. 

In the case A4 = T (and thus = 0), we can prove the convergence of \\A[ — 
^'I|l2(t) to zero in the following sense (which implies l|24p . using l(2T|) ): for any e G 

(0,l),Vt >t„ 

2 Af ^ 

\A[ - A'\\l,„^ < —-—E^it), (37) 



1 — e p 



where = min ^0, (47r^) ^ In 



(9zV'^(0, O)^ • This is obtained using the 



fact that Jj{dxi'^{t,-))'^ < Jrj.{dxip^{0,-))'^ Gxp{~8TT^t) (the proof of this estimate is 
similar to the one of Lemma [T2|) and the fact that for any function / G H^{T), 







f- f f 




Jt 


LOO Jt 



applied to / = tJj^ . Thus we have — lH^oo < /T(f^2;'0^(O, O)^ exp(— 87r^i) which 
impHes that for t > t^, ipi{t, ■) > 1 - e which yields ^ from l(23l) . 

Let us now prove (|24l) in the case M =R, under assumption [H4'] on W. Let us 
introduce a compact K C M. Since L°°{K) C H^{K) (with continuous injection), 
there exists c > such that 



v4 



1 



< 













C 






Thus, for any e G (0,r), there exists C > such that 



1 



< C exp{—rt), 



with r = f — e. This inequality is obtained from the fact that since satisfies 
LSI(r), then V'lo ^-Iso satisfies a Poincare inequality with the same constant r (see 
for example [2]), and a proof similar to that of Lemma [13] for the convergence of the 



Fisher information 
Now, we write 



V'Jo associated with the Poincare inequality. 



/ \A',-A'\yjl^ f \A',-A'\^^-[ 

JK JK JK \1p6o / 

< f |A;-^'|V + Ccxp(-rt) / \A[-A'\^i,. 

Thus, for t sufficiently large, \ A[ — A'\il]%-^ is bounded from above by some constant 
times /jj \A[ - A'pV^, which yields ^ (using ^ and ((2T|)). 

3.4 Proof of Theorem [2 

Let us now prove Theorem [2l We still assume, up to a change of variable, that (3 = \. 
We have: 



< - 



V 



Vln 



Ipoc 



\A[~A'\^i;U 



M 



Ve- Vln( ^ 
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Since, by Lemmas flOl and fTT| 



Af2 

\A[~A'\^ij^< — 

M P Jv 



Vs, In 



A. 

■0OC 



V', 



we thus obtain 
dE 



< 

dt 



Vln 



■000 



< 



Mm\ r 



0- 



Vln 



Mm 

2 



Ve- In 



01 







where we have used the fact that, for any function / : I? M, |V/p — |Vs^/P + 



^ • Vf 



The logarithmic Sobolev inequality with respect to ■(/'oo (see [H5]) con- 
cludes the proof. 



A The co-area formula 

The aim of this section is to state the co-area formula for a function ^ : V ^ W , 
(where 1 < p < n) such that rank(V^) — p. Classical proofs for the co-area formula 
can be found in the books llilSj. These proofs are however quite involved since they 
assume only Lipschitz-regularity for ^. The proof is simpler in the case of a smooth ^: 
it can be done by an adequate parameterization and a simple change of variables. 

Lemma 14 (co-area formula) For any smooth function 4> ■ M, 

/ (|){x)^/detG(x)dx ^ / / <j)das,dz, (38) 

JR" Jrp Js, 



where G is a p x p matrix with Gij — V<^i • Vfj. In the case p = 1, Equation (SB^ 
reads: 



(j>{x)\V^\{x)dx 



ida^^ dz, 



(39) 



Remark 12 This formula shows that if the random variable X has law ipix) dx in R", 
then S,{X) has law 

[ 0(detG')-i/2dCTE, dz, 
and the law of X conditioned to a fixed value z of S^{X) is 

0(detG)-i/2das, 



d^l:. 



0(detG)-i/2das, 



Indeed, for any bounded functions f and g, 
E(/(^(X))g(X)) = / f{ax))9ix)i;ix)dx, 

= [ [ foCgiPidetCy^/'^da^^dz, 

The measure {det G)^^/^da^^ is sometimes denoted by in the literature. 
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B Another possible set of assumptions for the con- 
vergence of the adaptive dynamics (llOI)-(fTT]) 

It is also possible to state a result similar to Theorem [T] for the dynamics lfT0|) - (fTT1) un- 
der slightly different assumptions than [H2] and [H3] by introducing another Rieman- 
nian structure on (see [15]) than that induced by the scalar product of the ambient 
space v. Let us introduce the following scalar product: G S^, ^u,v G IlcS^, 



(40) 



where • denotes as before the scalar product of the ambient space V, and the associated 
norm: Vx S S^, Vw € T^Yiz, 



IS, 



Accordingly, the definition of the surface gradient is modified as follow^ (compare 
with (dHl)): For / : R, 

VE./^lVer^PV/. (41) 

In particular, we have |Vs,/|s_, = |VCr^|PV/|. 

In this case, the Fisher information between the conditioned measures ^t,z and 
Moo,z is (see [H]): 



Vs. In 
FV In 



V'oo 



and the assumption [H3] is stated in terms of this new Fisher information: 

{V and ^ are such that 3p > 0, for dl\ z £ M, 
the conditional measure Hoo,z satisfies LSl(p), 
Y^z being endowed with the Riemannian structure (|40l) . 

Using this Fisher information, Lemma fTTl writes : 



Era{t) < 



2p Jv 



PV In 



The definition for the Wasserstein distance is now stated using the geodesic dis- 
tance ds-- Vx,j/ e S^, 



d^^ix,y) = inf 




w{t)\ldt 



eCi([0,l],E,),ii;(0)=x, w{l) = y}. 



Thus, the estimate of Lemma[TO]is changed to: 



\A[{z)-A'{z)\^ 



(F{x) - F{x'))7T{dx,dx') 



< iiiv^r^ |PVF| 



ds^ (x, x')'^7r{dx, dx'), 



^With a slight abuse of notation, we still use the same notation Ve, to denote the surface gradient, 
or I{fJ,t,z\l-Loo,z) to denote the Fisher information, or ds, to denote the geodesic distance, or p to denote 
the microscopic rate of convergence, while these are not the same as in the rest of the paper, since the 
Riemannian structure has been changed. 
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where F is defined by ([6]). Notice that 



Thus, assumption [H2] is modified as: 

. ( V and ^ are sufficiently diflFerentiable functions such that 

^ ^ I |||Vs.F|^J|^^<M<oo. 

The rest of the proof remains the same, and exponential convergence is thus obtained, 
assumptions [H2] and [H3] being respectively replaced by [H2'] and [H3']. With this 
set of assumptions, the rate of convergence is A = /?~^ min(p, r). 
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